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Description 

Background of the Invention 
5 Field of the Invention 

[0001] This invention relates to a data processing circuit, a microcomputer incorporating this data processing circuit, 
and electronic equipment that is constructed by using this microcomputer. 

10 Description of Related Art 

[0002] In the prior art a RISC microcomputer that is capable of manipulating 32-bit data uses fixed-length instructions 
that are 32 bits wide. This is because the use of fixed-length instructions enables a reduction in the time required for 
decoding the instructions in comparison with the use of variable-length instructions, and it also makes it possible to 
15 reduce the size of the circuitry of tne microcomputer. 

[0003] However, even with a 32-bit microcomputer, it isn't always necessary to use the full 32 bits. Thus, if all of the 
instructions are written as 32-bit instructions, many of them will contain unused portions so that memory will be used 
inefficiently. 

[0004] The present inventors have investigated the idea of a microcomputer that handles fixed-length instructions 
20 of a bit width that is narrower than that of the executable data, to improve the usage efficiency without making the 
control circuitry more complicated. 

[0005] However, simply cutting 32-brt fixed-length instructions to fit into a fixed 16-bit length, for example, causes 
problems as described below. 

[0006] An important feature of a RISC microcomputer is the use of general-purpose registers, whose contents have 
25 no preassigned meaning. For that reason, when a stack pointer is used, one of the general-purpose register is used 
as the stack pointer and an instruction that acts upon a general-purpose register is used to implement stack operation. 
[0007] When coding a process that takes data specified by a memory address obtained by adding a predetermined 
offset to the stack pointer and transfers it to a predetermined register, using an instruction that acts upon a general- 
purpose register, for example, it is necessary to include within the object code of that instruction the offset, data spec- 
30 ifying the predetermined register, and data specifying the register to be used as the stack pointer 

[0008] Thus, if an instruction that acts upon a general-purpose register is used when coding a process that acts upon 
the stack pointer, a large amount of data has to be specified in the object code, making rt difficult to code the details 
of the instruction within a 16-brt fixed-length. Increasing the instruction length to 32 bits, for example, would result in 
a large number of instructions that do not particularly need the entire 32 bits, so that many instructions will contain 
35 unused portions, leading to a decrease in the efficiency with which memory is used. 

[0009] In addition, increasing the instruction length will also increase the amount of memory required for storing such 
instructions, so it is preferable from the point of view of efficient use of memory to not only use fixed-length instructions, 
but also make the instructions as short as possible. 

[0010] When a program is executed that was written in a language that secures a storage region for auto-variables 
40 linked to the stack pointer, such as programming language C, there are many instructions that act upon the stack 
pointer, so it is preferable to have efficient coding and execution of instructions that act upon the stack pointer. 
[0011] In that case, it is therefore preferable to have an architecture that enables the coding and execution of in- 
structions that are as short as possible, when executing a process that acts upon the stack pointer. 
[0012] It has become common recently to incorporate general-purpose registers internally, particularly with a RISC 
45 CPU, to increase capabilities. Providing a large number of registers internally makes it possible for many processes 
to be executed rapidly within the CPU itself, without having to access memory. If such a large number of internal 
registers is provided, a large number of registers have to be saved during the process of register save and restoration 
when an interrupt is processed or a subroutine is called. 

[0013] The description now turns to a prior-art example of instructions that save and restore the contents of registers, 
50 frequently used when entering and leaving a subroutine, even within stack-related instructions. 

[0014] The instruction set of a microcomputer usually has instructions for saving and restoring the contents of reg- 
isters in the CPU to and from a stack provided in memory. These are either dedicated instructions or instructions that 
address registers indirectly. 

[0015] In the Intel 80386 chip, push, pusha, and pushad instructions are available for writing the contents of registers 
55 to the stack, and pop, popa, and popad instructions are available for returning data from the stack to registers. 

[0016] When the contents of a register are written to the stack by the push instruction, the register has to be specified 
as an operand, such as 'push EAX. 1 This example concerns a 32-bit register EAX. If the contents of registers EAX, 
ECX, EDX, and EBX are all to be written to the stack, this push instruction must be repeated, as follows: 
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push EAX 
push ECX 
push EDX 
push EBX 

5 

[0017] If such push and pop instructions are used for each of the registers, the size of the object code increases and 
a large number of program execution steps is required, leading to delays in the execution time or processing of the 
program. 

[0018] That is why the pusha or pushad instruction is used to write the contents of all eight of the general-purpose 
10 registers of the 80386 to the stack, pusha operates on the lower 16 bits of each of the eight registers, and pushad 
operates on the entire 32 bits. The use of pusha and pushad can remove the need to repeat the push instruction eight 
times. 

[0019] The pop, popa, and popad instructions act in a similar manner 

[0020] Disadvantages of repeating the push instruction include the increased length of program code and the delays 
is in execution caused by executing a fetch for each instruction. From this point of view, the pusha and pushad instructions 
provide a large improvement when writing the contents of all eight registers to the stack. However, this is not an ad- 
vantage when writing fewer than the eight registers, such as four or six registers. 

[0021] In other words, if the pusha, pushad, popa, and popad instructions of the 80386 operate upon all of the reg- 
isters, instructions with delayed cycle lengths must be used when there is no need to save and restore the contents 
20 of all of the registers. In such a case, one instruction will suffice, but this gives rise to a problem in that the execution 
cycle of that instruction takes too long. 

[0022] Similar saving and restoration of the program counter is necessary when program flow branches to a sub- 
routine and returns from the called routine, using instructions such as the call instruction and ret instruction. With a 
RISC CPU of the prior art, these processes are implemented by software. In other words, the program counter is saved 
25 and restored by executing assembler instructions (object code) that declare these processes. This leads to an increase 
in object code of the call and ret instructions, and the execution of a fetch for each instruction makes the execution 
speed sluggish. 

[0023] The document J.F. WAKERLY: "Micro-computer Architecture and Programming", 1981, JOHN WILEY & 
SONS, INC. NEW YORK, US XP-002084219 describes the MOTOROLA 68000 microprocessor architecture that in- 
30 eludes features corresponding to those in the pre-characterizing portion of claims 1 and 2. WO 9608767 A discloses 
a microcontroller system capable of transferring data between a memory stack and a pluralfty of general-purpose 
registers listed in an instruction in a single instruction cycle. A single instructions causing data for a plurality of registers 
to be pushed or poped to or from the stack include, inter alia, a binary list of the registers involved. 

35 Summary of the Invention 

[0024] An objective of this invention is to provide a data processing circuit, microcomputer, and electronic equipment 
having an architecture that enables efficient coding of processes that act upon the stack pointer within a short instruction 
length, for execution. 

40 [0025] Another objective of this invention is to provide a data processing circuit, microcomputer, and electronic equip- 
ment that enables efficient coding of processes that save and restore the contents of registers, with fast processing of 
interrupts and subroutine calls and returns. 

[0026] These objects are achieved by a data processing circuit as daimed in claims 1 and 2 and the preferred em- 
bodiments defined in the dependent claims. 
45 [0027] In accordance with the invention it is made possible to implement the saving to the stack of the contents of 
a plurality of registers specified by linked sequential values and the restoration of data to a plurality of registers specified 
by linked sequential values, using only a count means and a simple sequence controller. Since this makes it possible 
to provide a data processing circuit with a small number of gates, this invention can be applied to one-chip microcom- 
puters or the like. 

50 [0028] When there are plural general-purpose registers, there are usually addresses for specifying these registers. 
A preferred embodiment of the invention specifies these registers by register number, in continuous sequence from 0 
ton-1. 

[0029] This aspect of the invention makes it possible to execute at least one of a push or pop of data between memory 
and a plurality of continuous registers from register 0 to the register with a final register number, by specifying any 
55 register number as the final register number. Therefore, the contents of registers can be saved and restored efficiency 
during the execution of a program having a structure that uses registers in sequence from register number 0. 
[0030] According one embodiment, a subroutine could be such as an interrupt processing, exception processing, or 
debugging processing routine. Therefore, an instruction for branching to a subroutine could be such as an instruction 
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that calls a subroutine or the like, or a software interrupt instruction or software debugging interrupt instruction for 
branching to an interrupt processing, exception processing, or debugging processing routine. Similarly, a return in- 
struction from the subroutine could be a return instruction such as from an interrupt processing, exception processing, 
and debugging processing routine. 
5 [0031] It is usually necessary to save and restore the program counter when branching to a subroutine or returning 
from a subroutine. 

[0032] This embodiment makes it possible to save and restore the program counter simultaneously with the execution 
of the instruction for branching to the subroutine and the instruction for returning from the subroutine. In other words, 
the data processing circuit of this aspect of the invention has a circuit structure that enables the saving and restoration 

10 of the program counter by either one of instruction, within an instruction for branching to a subroutine and a return 
instruction from the subroutine. There is therefore no need for instructions for the saving and restoration of the program 
counter, which is necessary when branching to a subroutine and returning therefrom, enabling a reduction in the number 
of instructions. This makes it possible to increase the speed of processing during a branch to another routine, such as 
subroutine call and return, without wasting cycles. 

15 [0033] When a software interrupt instruction is generated, for example, it is necessary to save and restore the proc- 
essor status register that holds the current state of the CPU or other data processing circuit. It is therefore preferable 
to perform this saving and restoration of the processor status register simultaneously with the execution of an instruction 
such as a software interrupt instruction. According to another embodiment, the data processing circuit uses instructions 
of a RISC. 

20 [0034] A RISC data processing circuit is designed with the objectives of reducing the dimensions of the hardware 
and increasing the speed thereof. Thus, a RISC data circuit has many general-purpose registers. Reduction of the 
number of instructions is achieved by using only an instruction set which has universality. 

[0035] Therefore, with a RISC data processing circuit, the stack pointer is allocated to a general-purpose register, 
and the processing uses an instruction set that acts upon this general-purpose register when manipulating the stack 
25 pointer. However, this method leads to an increased instruction length, so that memory is used inefficiently. 

[0036] This aspect of the invention makes it possible to increase the efficiency with which memory is used by a RISC 
data processing circuit, by reducing the instruction length. 

[0037] When the use of fixed-length instructions is compared with the use of variable-length instructions, it is possible 
to shorten time required for decoding the instructions and reduce the dimensions of the data processing circuit itself. 
30 To ensure that memory is used efficiently when fixed-length instructions are employed, by avoiding the creation of 
unused portions within instruction, it is preferable to have little variation in the number of bits necessary for each 
instruction, so that they are as short as possible. 

[0038] Another embodiment of the invention makes it possible to reduce the length of instructions that act upon the 
stack pointer and thus tend to become too long, tt is therefore possible to avoid the creation of unused portions within 
35 instructions, even when fixed-length instructions are employed, so that memory can be used more efficiently. 

Brief Description of the Drawings 

[0039] 

40 

Fig. 1 is a schematic view of the circuit structure of a CPU in accordance with a first embodiment of 

this invention; 

Fig. 2 shows a register set of the CPU of this embodiment; 

Fig. 3 is a view illustrative of the basic operation of the stack pointer; 

45 Fig. 4 shows an example of the usage of a dedicated stack pointer instruction, as well as the usage 

state of the stack provided in memory and the state of the stack pointer; 

Fig. 5 is a view illustrative of the process of transferring auto-variables in memory to registers; 

Figs. 6A and 6B show the bit fields of an SP-relative load instruction and a general-purpose load instruction; 

Fig. 7 is a flowchart illustrative of the operation during the SP-relative load instruction for reading word 

50 data; 

Fig. 8 is a flowchart illustrative of the operation during the SP-relative load instruction for writing word 

data; 

Figs. 9A to 9F show views illustrative of the usage state of the stack in memory and the state of the stack pointer 

during each routine of an executing program that extends over a plurality of routines; 
55 Figs. 10A and 10B show the bit fields of a stack pointer move instruction and a general immediate data arithmetic 
instruction; 

Fig. 11 is a flowchart illustrative of the operation during the add stack pointer move instruction; 

Fig. 12 is a flowchart illustrative of the operation during the subtract stack pointer move instruction; 
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Fig. 13 is a view illustrative of program execution control during the call and ret instructions; 

Fig. 14 shows the bits field of an PC-relative subroutine call instruction; 

Fig. 15 is a flowchart illustrative of the operation during the PC-relative subroutine call instruction; 

Fig. 16 is a flowchart illustrative of the operation during the ret instruction; 

5 Figs. 1 7A and 1 7B schematically show the actions during the execution of push instructions; 

Figs. 1 8A and 1 8B schematically show the actions during the execution of pop instructions; 

Fig. 19 shows the bit structure of the pushn and popn instructions; 

Fig. 20 is a block diagram of the hardware configuration required for executing the sequential push 

instruction (pushn) or sequential pop instruction (popn); 

10 Fig. 21 is a flowchart illustrative of the operation during the pushn instruction; 

Fig. 22 is a flowchart illustrative of the operation during the popn instruction; and 

Fig. 23 is a block diagram of the hardware of a microcomputer in accord a nee with a second embodiment 

of this invention. 



15 Description of Preferred Embodiments 

[0040] Embodiments of this invention are described below with reference to the accompanying drawings. 
Embodiment 1 

20 

(1) Configuration of the CPU of this Embodiment 

[0041] The CPU of this embodiment executes virtually all of its instructions within one cycle, thanks to its pipeline 
and load/store architecture. All of the instructions are expressed as 16-bit fixed-length instructions, so that instructions 

25 processed by the CPU of this embodiment are implemented within compact object code. 

[0042] In particular, this CPU is configured in such a manner that it has a dedicated stack pointer register to enable 
the efficient coding and execution of processes that act upon the stack pointer. It also enables the decoding and exe- 
cution of an instruction set of a group of dedicated stack pointer instructions that have object code specifying that 
dedicated stack pointer register as an implicit operand. 

30 [0043] A schematic view of the circuit structure of the CPU of this embodiment is snown in Fig. 1 byway of illustration. 
[0044] This CPU 10 comprises a register set that includes general-purpose registers 11, a PC register 12 for holding 
the program counter, PSR (processor status register) 13, and a dedicated stack pointer register SP 14; an instruction 
decoder 20; an immediate data generator 22; an address adder 30; an ALU 40; a PC incrementer 44; various internal 
buses 72, 74, 76, and 78; and various internal signal lines 82, 84, 86, and 88. 

35 [0045] The instruction decoder 20 decodes object code that has been input, performs the processing necessary for 
the execution of the resultant instruction, and outputs any necessary control signals. Note that this instruction decoder 
20 also functions as a decoding means for decoding the object code of the above mentioned dedicated stack pointer 
instruction and outputting control signals on the basis of that instruction. 

[0046] The immediate data generator 22 generates 32-bit immediate data that will be used during execution, on the 

40 basis of immediate data comprised within the object code, and generates the 0, • 1, • 2, and • 4 constant data that is 
necessary for the execution of each instruction. The PC incrementer 44 updates the program counter stored in the PC 
12, on the basis of the execution cycles of the instructions. The address adder 30 adds the data that is stored in the 
various registers and the immediate data generated by the immediate data generator 22 to generate the address data 
required during the reading of data from memory. The ALU 40 performs numerical calculations and logical operations. 

45 [0047] This CPU also comprises internal bus and signal lines. The functions of a PA_BUS 72 and a PB_BUS 74 
include the transfer of input signals for the ALU 40. The functions of a WW_BUS 76 include the fetching of calculation 
results from the ALU 40, to transfer them to the general-purpose registers. The functions of an XA_BUS 78 include 
the transfer of address data that has been fetched from the general-purpose registers 11 and the SP 14. An I A signal 
line 82 transfers address data from the various components within the CPU to an external l_ADDR_BUS 92. A DA 

50 signal line 84 transfers address data from the various components within the CPU to an external D_ADDR_BUS 96. 
A DIN signal line 86 transfers data from an external D_DATA_BUS 98 to the various components within the CPU. A 
DOUT signal line 88 transfers data from the various components within the CPU to the external D_DATA_BUS 98. An 
IA multiplexer 83 switches between signals that are to be output to the IA signal line 82 (signals on the PA_BUS 72, 
on the WW_BUS 76, from the PC 12, and a value that is the PC plus 2). A DOUT multiplexer 89 switches between 

55 signals that are to be output to the DOUT signal line 88 (signals on the PA_BUS 72, on the WW_BUS 76, from the PC 
12, and the value that is the PC plus 2). 

[0048] Since the components of the CPU 1 0 execute instructions on the basis of control signals that are output from 
the instruction decoder 20, they also function as means for executing a group of dedicated stack pointer instructions 
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on the basis of the above described control signals and the contents of the dedicated stack pointer register. 
[0049] This CPU 10 transfers signals to and from the exterior through a 16-bit instruction data bus (l_DATA_BUS) 
94 ( an instruction address bus (l_ADDR_BUS) 92 for instruction data access, the 32-bit data bus (D_DATA_BUS) 98, 
the data address bus (D_ADDR_BUS) 96 for data access, and a control bus (not shown in the figure) for control signals. 

5 

(2) Register Set of the CPU of this Embodiment 

[0050] Essential portions of the concept of the register set of the CPU of this embodiment will now be described. 
[0051] The register set of the CPU of this embodiment is shown in Fig. 2. This CPU has a register set that comprises 
10 the 16 general-purpose registers 11, the PC 12, the PSR 13, the SP 14, ALR (arithmetic low register) 15, and AHR 
(arithmetic high register) 16. 

[0052] The general-purpose registers 11 are functionally equivalent to 32-bit registers and are labeled R0 to R15. 
These general-purpose registers 11 are used during data calculations and address computations. 
[0053] The PC 12 is a 32-brt-long incremental counter that holds a program counter indicating the address of the 
15 currently executing instruction, in this document, "PC" is used when referring to the register itself and "program counter" 
is used when referring to the value stored within the PC. 

[0054] The PC- 12 cannot be accessed directly by instructions such as the load instruction. With a call or int instruc- 
tion, or when an interrupt or exception occurs, the program counter is read from the PC 12 and saved to the stack. In 
this manner, a jump destination address used when a branch instruction is executed can be set in the PC. This also 
20 happens for a branch specified by a conditional branch instruction. The instruction address for the return destination 
is read from the stack by a ret or reti instruction and is replaced in the PC 12. 

[0055] PSR (processor status register) 1 3 is a 32-bit register to which a flag is allocated, and which holds the current 
status of the CPU. When an int instruction, an interrupt, an exception, or the like occurs, the state of the PSR at that 
point is saved to the stack when the flow branches to the corresponding processing routine. Conversely, the execution 

25 of the reti instruction causes the saved value to be replaced in the PSR. 

[0056] The SP 14 is a 32-bit dedicated stack pointer register containing the stack pointer that indicates the higher 
address of the stack. In this document, SP is used when referring to the register itself and stack pointer is used when 
referring to the value stored within the SP. Note, however, that since the stack pointer always points to a word boundary, 
the lowermost two bits of the stack pointer are always zero. 

30 [0057] This stack pointer is updated at the generation of a trap or at the execution of one of a group of dedicated 
stack pointer instructions that are provided by this embodiment 

[0058] Examples of these dedicated stack pointer instructions that update the stack pointer include instructions that 
branch to another routine, sucn as the call and ret instructions, a stack pointer move instruction, a pushn instruction, 
and a popn instruction. For instance, when the call instruction is executed, the stack pointer is first decremented by 
35 an amount equivalent to the word size (-4), then the PC 12 is saved to the stack. Conversely, when the ret instruction 
is executed, the destination address for return is loaded into the PC from the stack, and the stack pointer is incremented 
by an amount equivalent to the word size (+4). When an int instruction is executed, or when an interrupt or exception 
occurs, the value in the PC or PSR is saved to the stack by the following procedure: 

40 i. sP = SP-4 

2. The PC is saved at the higher address of the stack, indicated by the stack pointer 

3. sP = SP-4 

4. The PSR is saved at the higher address of the stack, indicated by the stack pointer. 

45 [0059] When the reti instruction is executed, the above process is reversed to restore the CPU to its original state. 
Thus the execution of the call, ret, or int instruction causes the stack pointer to be updated as appropriate for that 
execution. Details of each instruction will be given later. 

[0060] A trap function provided by this embodiment comprises interrupts that are generated asynchronously with the 
execution of instructions and exceptions generated by the execution of instructions. When a trap is generated, the 

50 CPU saves PC (program counter) and PSR (process status register) to the stack, then reads a vector table from a trap 
table and branches to a routine corresponding to that trap. Concomitant with the generation of the trap, IE (interrupt 
enable) bit is cleared and the generation of subsequent maskable interrupts is inhibited. Maskable external interrupts 
are enabled again by using a load instruction with respect to the PSR to write 1 to the IE bit. 
[0061] The reti instruction is used to return from the trap processing routine to the original routine. When the reti 

55 instruction is executed, the CPU reads the PSR and PC from the stack in that order, restores the PSR to its original 
value and also branches to the return address. Note that exceptions include debugging exceptions, address misalign- 
ment exceptions, overflow exceptions, and zero-division exceptions. 

[0062] Detailed descriptions of ALR (arithmetic low register) 15 and ALH (arithmetic high register) 16 are omitted. 
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[0063] The special registers PSR 13, SP 14, ALR 15, and AHR 16 of the CPU are capable of transferring data to 
and from the general-purpose registers, using load instruction. Each register has a special register number and is 
accessed by using that number 



Special Register Name 


Special Register Number 


Assembler Mnemonic 


Process status register 


0 


%PSR 


SP 


1 


%SP 


Arithmetic low register 


2 


%ALR 


Arithmetic high register 


3 


%AHR 



(3) Description of Stack and Stack Pointer 

[0064] The stack is a temporary storage region provided in memory. It is a contiguous region in which data is written 
15 from the bottom up, as if stacked in a rack. The stack pointer indicates the address of the data at the top of the stack, 
that is, the data that was written most recently into the stack. 

[0065] The basic operation of the stack pointer will now be described with reference to Fig. 3. 
[0066] Reference number 1 00 in Fig. 3 denotes a stack region provided in memory. A hatched portion 1 02 represents 
data that was most recently stored in this region, and a memory address 1000 indicates that data. Note that a region 
20 104 below the hatched portion 102 contains data that has already been stored and a portion 106 above the hatched 
portion 102 is available for storing data in the future. 

[0067] The stack pointer always indicates a word boundary. This means that, when data is written to the stack, the 
stack pointer within the SP 14 moves upward by 4 and the data is stored at the location indicated by that stack pointer. 
When data stored in the stack is fetched, it is the data at the address indicated by the current SP 14 that is fetched 
25 and the stack pointer stored in the SP 14 moves downward by 4. In this manner, the stack pointer always indicates 
the storage address of the data that was stored most recently in the stack. 

(4) Description of Group of Dedicated Stack Pointer Instructions 

30 [0068] A RISC CPU always uses a general-purpose register as the stack pointer. This embodiment has the SP 14, 
which is a dedicated stack pointer register, which is operated upon by the previously mentioned group of dedicated 
stack pointer instructions. 

[0069] This group of dedicated stack pointer instructions is a generic name for a number of instructions that access 
the SP 14 as an implicit operand and perform operations on the SP 1 4. The group of dedicated stack pointer instructions 
35 comprises instructions that branch to other routines, such as SP-relative load instructions (Id, etc.), stack pointer move 
instructions (aad, sub), instructions for branching to subroutines (call, etc.), and return instructions (ret, etc.); a se- 
quential push instruction (pushn); and a sequential pop instruction (popn). 

[0070] What these instructions have in common is that they are dedicated stack pointer instructions so there is no 
need for data specifying the stack pointer in the object code. Another common feature is that they enable the efficient 
40 coding in short instructions of processes that use the stack pointer. 

[0071] The use of these instructions makes it possible to efficiently process data that is stored in a stack provided 
in memory. Interrupt and subroutine call/return processes can also be performed efficiently. 

[0072] An example of the usage of these dedicated stack pointer instructions when a subroutine is called will now 
be described with reference to Fig. 4. This figure also shows the usage state of the stack provided in memory and the 

45 state of the stack pointer. A MAIN program 500 and a subroutine 520 of Fig. 4 are written as object code created by a ) 
C compiler. Reference number 540 denotes the state of a stack in memory. Reference number 502 indicates that 
processing is being performed, using the general-purpose registers R0 to R3. Reference number 506 denotes a sub- 
routine call instruction. Before the subroutine call instruction is executed by the MAIN program 500, in other words, 
after the previous instruction indicated by 504 has been executed, the stack pointer (SP) points to an address <1> in 

so the stack in memory. When this subroutine call instruction is executed, control passes to the subroutine 520. At this 
point, an instruction for branching to a subroutine (call instruction), which is one of the dedicated stack pointer instruc- 
tions of this embodiment, is executed. When this instruction is executed, the value of the stack pointer (SP) is auto- 
matically incremented by -4 (at <2> in Fig. 4) and the address for return to the MAIN program is stored in an area 544 
of the stack that is indicated by the stack pointer <2>. 

55 [0073] When the subroutine 520 starts executing, it first transfers the values that are stored in the general-purpose 
registers R0 to R3 used by the MAIN program. Reference number 524 denotes a sequential push instruction (pushn) 
which is one of the dedicated stack pointer instructions that saves the values stored in the general-purpose registers 
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R0 to R3 to the stack. When this instruction is executed, the values stored in the general-purpose registers R0 to R3 
are transferred in sequence to the stack, so that they are stored in the stack 540 as shown at 550 in Fig. 4. When the 
execution of this process ends (524 <3>), the stack pointer (SP) is at 546 (<3> SP). 

[0074] The subroutine 520 then secures an auto-variable region to be used by the subroutine. An add instruction 
5 denoted by 526 is a stack pointer move instruction which is a dedicated stack pointer instruction that causes the stack 
pointer to move upward to secure a stack region to be used by the subroutine 520. When this instruction is executed, 
the stack pointer (SP) moves upward by X bytes to a location 548 (<4> SP) to secure an auto-variable region to be 
used by the subroutine, as stated in Note 2. 

[0075] Reference number 528 denotes processing that uses auto-variables and the general-purpose registers R0 
10 to R3 within the subroutine 520. At this point, the location of the stack pointer is as indicated by <5> and the loading 
of the auto-variables is performed by using a dedicated SP load instruction that is one of the dedicated stack pointer 
instructions. 

[0076] Reference number 529 denotes this SP load instruction that transfers an auto-variable S1 that is stored in 
memory to the general-purpose register R1. This auto-variable S1 is stored at a location that is offset by Y bytes from 

15 the stack pointer (<5> SP). The stack pointer does not move during the above processing of the subroutine 528, so 
the memory address of the auto-variable is specified by the stack pointer plus the offset This makes it possible to use 
the dedicated SP load instruction to fetch data to and from the general-purpose registers efficiently. 
[0077] Before control returns from the subroutine 520 to the MAIN program 500, the values in the general-purpose 
registers R0 to R3 that were saved to the stack must be restored to the general-purpose registers R0 to R3 and the 

20 stack pointer must be reset to indicate the area 544 that contains the address for return to the MAIN routine. This 
restores the stack pointer that was moved by the stack pointer move instruction of 526 back to its original location. A 
sub instruction indicated by 530 is a stack pointer move instruction which is a dedicated stack pointer instruction that 
moves the stack pointer downward. When this instruction is executed, the stack pointer moves to 546 (<6> SP). The 
data 550 stored in the stack is restored to the general-purpose registers R0 to R3. Reference number 532 denotes a 

25 sequential pop instruction (popn) which is a dedicated stack pointer instruction that transfers data in the stack to the 
general-purpose registers R0 to R3. When this instruction is executed, the values 550 stored in the stack are sequen- 
tially transferred to the general-purpose registers R0 to R3 so that they are stored in the stack 540 as shown at 550 
in Fig. 4 When the execution of this process ends (532 <7>), the stack pointer indicates 544 (<7> SP). 
[0078] Reference number 534 denotes a return instruction. When this return instruction is executed control passes 

30 to the MAIN program. In this case, a return instruction (ret instruction) which is a dedicated stack pointer instruction is 
executed by this embodiment. When this instruction is executed, control branches to an instruction that indicates the 
address for return to the MAIN program, which is stored in the stack area indicated by <7> SR In other words, the 
MAIN program 500 returns to the next instruction 507. The value of the stack pointer is automatically incremented by 
+4 to move it to the top area of the stack used by the MAIN program 500 (<8> SP in Fig. 4). 

35 [0079] . The description now turns to details of each of the dedicated stack pointer instructions, as well as the circuit 
structure required for executing these instructions and the operation during such execution. 

(5) Stack Pointer (SP) Relative Load Instructions 

40 [0080] A C compiler creates object code that specifies that an auto-variable region is to be linked to the stack pointer. 
More specifically, the C compiler acts as means for securing an auto-variable region that is Y bytes long, at an offset 
of X from the stack pointer. 

[0081] A view used for illustrating the process of transferring auto-variables in memory to registers is shown in Fig. 
5. In this figure, auto-variable a is word data, auto-variables b and c are each half-word data, and auto-variables d to 

45 g are each byte data. The stack pointer stored in the SP indicates 1000, which is the address of the top region of a 
region 600 of the stack in which auto-variables are stored. An area of the stack at the memory address indicated by 
the stack pointer is secured as an area for holding the auto-variable a. Areas in the stack at memory addresses indicated 
by the stack pointer plus 2 and the stack pointer plus 4 are secured as areas for holding the auto-variables b and c, 
respectively. Similarly, areas in the stack at memory addresses indicated by the stack pointer plus 5, the stack pointer 

so plus 6, the stack pointer plus 7, and the stack pointer plus 8 are secured as areas for holding the auto-variables d to 
g, respectively. During the execution of a given process, it may be necessary to transfer data between a general- 
purpose register and an auto-variable which is specified by a memory address that is the stack pointer plus an offset. 
[0082] To enable the CPU of this embodiment to execute the above transfer processing efficiently with shorter object 
code, the following instruction set is provided as SP-relative load instructions that are some of the dedicated stack 

55 pointer instructions: 

Id.b %Rd, [%sp+imm6] (1) 
ld.ub%Rd, [%sp+imm6] (2) 
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Id.h %Rd, [%sp+imm7) 
Id.uh %Rd, [%sp+imm7] 
Id . w %Rd, [%sp+imm8] 
Id.b [%sp+imm6], %Rs 
5 Id.h [%sp+imm7], %Rs 
Id w [%sp+imm8], %Rs 

Instructions (1 ) to (8) are instruction codes created by an assembler. Instruction ( 1 ) sign-expands byte data and trans- 
fers it from the stack to a register, instruction (2) zero-expands byte data and transfers it from the stack to a register, 
10 instruction (3) sign-expands half-word data and transfers it from the stack to a register, instruction (4) zero-expands 
half-word data and transfers it from the stack to a register, instruction (5) transfers word data from the stack to a register, 
instruction (6) transfers byte data from a register to the stack, instruction (7) transfers half-word data from a register 
to the stack, and instruction (8) transfers word data from a register to the stack. 

[0083] The operands [%sp+imm6], [%sp+imm7], and [%sp+imm8] each represent immediate offset data. A memory 

15 address is created by adding an offset that is created during the execution of the instruction on the basis of this imme- 
diate offset data and the value of the stack pointer stored in the SP 14. [%sp+imm6] is immediate offset data used 
when the size of the data in memory is bytes, [%sp+imm7] is that when the data size is half-words, and [%sp+imm8] 
is that when the data size is words. Whichever size of data is used, it is written as 6-bit immediate offset data 614 in 
the object code, as will be described later with reference to Fig. 6A. If imm7 is used (in other words, if the data size is 

20 half-words), however, the immediate offset data 614 is shifted one bit to the left during the execution of the instruction, 
to generate an offset to be added to the stack pointer Similarly, if imm8 is used (in other words, if the data size is 
words), the immediate offset data 614 is shifted two bits to the left, to generate an offset to be added to the stack pointer 
[0084] In this case, the stack is a temporary storage region provided in memory and the above memory addresses 
can be used to specify the locations of, for example, auto-variables (a to g) within the stack, as shown in Fig. 5. 

25 [0085] An example of a bit field 610 of these SP-relative load instructions (1) to (8) is shown in Fig. 6A. The SP- 
relative load instruction of Fig. 6A has 16 bits of object code comprising op code 612 (6 bits) indicating that the operating 
function is a transfer of data between memory and a general-purpose register, immediate offset data 614 (6 bits) 
specified by immediate data, and a register number 61 6 (4 bits) specifying the register that is to be used for the transfer 
The op code 612 comprises common code indicating thatthis is a load instruction that acts upon the SP 14 and different 

30 codes specifying each of data size, sign expansion, and zero expansion. Since it is therefore clear from the op code 
that these instructions act upon the stack pointer stored in the SP 14, there is no necessity to specify data relating to 
the stack pointer in the operands of the object code. The immediate offset data 614 is used for creating an offset from 
the stack pointer for data that is to be transferred. This is specified in 6 bits, regardless of the data size, as described 
above. With instructions (1)to (5), the address (register number) 616 of the register that is involved in the transfer will 

35 contain the number of the register to contain data that has been read out from the stack; with instructions (6) to (8), it 
will contain the number of the register that currently contains data to be written to the stack. 

[0086] By way of comparison, Fig. 6B shows an example of a bit field 620 of object code of a load instruction that 
acts upon a general-purpose register when another general-purpose register is used as the stack pointer (hereinafter 
called a general-purpose load instruction). . 

40 [0087] The general-purpose load instruction of Fig. 6B has 20 bits of object code comprising op code 622 (6 bits) 
indicating that the operating function is a transfer of data between memory and a general-purpose register, immediate 
offset data 624 (6 bits) specified by immediate data, a first register number 620 (4 bits) specifying the general-purpose 
register to be used as the stack pointer, and a second register number 628 (4 bits) specifying the register that is to be 
used for the transfer With a general-purpose microcomputer, the instruction length is in 8-bit units, which gives 24-bit 

45 or 32-bit instructions. 

[0088] Each of Figs. 6A and 6B shows object code for an instruction used when transferring data between a register 
and a memory address specified by adding an offset to the stack pointer. It is clear from these examples that a SP- 
relative load instruction can be written with shorter object code than a general-purpose load instruction. 
[0089] The description now turns to the configuration required for executing the above instructions and the operations 
so that occur during this execution, taking as examples the instruction (5) that transfers word data from the stack to a 
register (since this instruction reads word data from the stack to a register, it is hereinafter called an SP-relative load 
instruction for reading word data) and the instruction (8) that transfers word data from a register to the stack (since this 
instruction writes word data from a register to the stack, it is hereinafter called an SP-relative load instruction for writing 
word data). 

55 [0090] The hardware configuration necessary for executing the instructions will be described first, using Fig. 1 for 
reference. Each of these instructions is transferred over the l_DATA_BUS 94 from an external memory (ROM) 52 and 
is input to the instruction decoder 20. The instruction is decoded by the instruction decoder 20, which outputs various 
signals (not shown in the figure) necessary for executing the instruction. The immediate data generator 22 performs 
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a leftward logical shift on the immediate offset data 614 in accordance with the data size, generates the offset used 
during the execution of sign expansion or zero expansion, if necessary, and outputs the results to the PB_BUS 74. 
The SP 14 contains the stack pointer, and that value can be output to the XA_BUS 78 which is connected to an input 
of the address adder 30. Another input of the address adder 30 is connected to the PB_BUS 74 that is an output of 
5 the immediate data generator 22. An output (ADDR) of the address adder 30 is connected to the external D_ADDR_BUS 
96 by the DA signal line 84. 

[0091] BCU (bus control unit) 60 controls the input and output of data to and from memory (RAM and ROM) 50 and 
52 (which includes the stack area) and outputs READ and WRITE control signals in accordance with various request 
signals that are output from the CPU (such as signals output to external buses). 
10 [0092] The description first concerns the operation during the execution of an SP-relative load instruction for reading 
word data. 

[0093] When the SP-relative load instruction for reading word data is executed, the value of the stack pointer stored 
in the SP 14 is added to the offset created by the immediate data generator 22 on the basis of the immediate offset 
data 614, to create a memory address for reading. Data is read from memory on the basis of this memory address and 
15 is transferred to the general-purpose register specified by the register number 616 in the object code. 

[0094] A flowchart illustrating the operation during the SP-relative load instruction for reading word data is shown in 
Fig. 7. 

[0095] At the start of execution of this instruction, the stack pointer stored in the SP 14 is output to the XA_BUS 78 
(step S210). An offset imm created by the immediate data generator 22 from the immediate offset data is then output 

20 to the PB_BUS 74 (step S212). The address adder 30 adds the value on theXA_BUS 78 and the value on the PB_BUS 
74, then outputs the resultant memory read address (ADDR) over the DA signal line 84 to the D_ADDR_BUS 96 (steps 
S214 and S216). A data read request signal from the CPU to the BCU 60 becomes active and executes an external 
memory read cycle (step S218). In other words, the BCU 60 follows the request signal to control the reading of data 
from memory, using this read address as a memory address, and the output of the data to the D_DATA_BUS 98. The 

25 data on the D_DATA_BUS 98 is output to the WW_BUS 76 over the DIN signal line 86 (step S220). The value on the 
WW_BUS 76 is stored in the register (%Rd) having the register number specified by the 4-bit address 616 of the register 
(Ra/Rs) that is specified in the instruction code for the transfer (step S222). 

[0096] The description now turns to the operation during the execution of an SP-relative load instruction for writing 
word data 

30 [0097] When the SP-relative load instruction for writing word data is executed, the value of the stack pointer stored 
in the SP 14 is added to the offset created by the immediate data generator 22 on the basis of the immediate offset 
data 614, to create a memory address for writing to memory. The data stored in the general-purpose register specified 
by the register number 616 in the object code is transferred to the areas in memory specified by that memory address. 
[0098] A flowchart illustrating the operation during the SP-relative load instruction for writing word data is shown in 

35 Fig. 8. 

[0099] At the start of execution of this instruction, the stack pointer stored in the SP 14 is output to the XA_BUS 78 
(step S230). An offset imm created by the immediate data generator 22 from the immediate offset data is then output 
to the PB_BUS 74 (step S232). The address adder 30 adds the value on the XA_BUS 78 and the value on the PB_BUS 
74, then outputs the resultant memory write address (ADDR ) over the DA signal line 84 to the D_ADDR_BUS 96 (steps 

40 S234 and S236). The data stored in the register (%Rd) having the register number specified by the 4-bit address 616 
of the register (Rs/Rd) that is specified in the instruction code for the transfer is output to the PA_BUS 72 (step S238). 
The data on the PA_BUS 72 is output to the D_DATA_BUS 98 over the DOUT signal line 88 (step S240). A data write 
request signal from the CPU to the BCU 60 becomes active and executes an external memory write cycle (step S242). 
In other words, the BCU 60 follows the request signal to control the operation of writing to memory the data that has 

45 been transferred to the D_DATA_BUS 98, using this write address as a memory address. 

(6) Stack Pointer Move Instruction 

[0100] Views used to illustrate the usage state of the stack in memory and the state of the stack pointer during each 
50 routine of a program that extends over a plurality of routines are shown in Figs. 9A to 9F. 

[0101] The states of the stack region in memory and the stack pointer (the value stored in the SP 14) during the 
execution of a given process 'a' 211 of a MAIN routine 210 of Fig. 9A are shown in Fig. 9D. Reference number 222 
denotes a stack region that is to be used by the MAIN routine 210, and the stack pointer indicates a higher address 
232 of the region 222 

55 [0102] A SUB1 routine 212 of Fig. 9B is a subroutine that is called and executed from the MAIN routine 210. The 
states of the stack region in memory and the stack pointer (the value stored in the SP 14) during the execution of a 
given process *b' 213 of the SUB1 routine 212 are shown in Fig. 9E. Reference number 224 denotes a stack regicn 
that is to be used by the SUB1 routine 212, and the stack pointer indicates a higher address 234 of the region 224. 
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[0103] A SUB2 routine 214 shown in Fig. 9C is called ano executed from the SUB1 routine 212. The states of the 
stack region in memory and the stack pointer (the value stored in the SP 14) during the execution of a given process 
'c* 21 5 of the SUB2 routine 214 are shown in Fig. 9F. Reference number 226 denotes a stack region that is to be used 
by the SUB2 routine 214, and the stack pointer indicates a higher address 236 of the region 226. 
5 [0104] When execution extends over a plurality of subroutines in this manner, the stack region used by each routine 
moves, so that the value of the stack pointer is moved as appropriate to the top of the stack region used by each routine. 
[0105] To enable the CPU of this embodiment to execute this movement of the stack pointer efficiently with shorter 
object code, the following instruction set is provided as stack pointer move instructions that are some of the dedicated 
stack pointer instructions: 

10 

add %sp,imm12 (9) 
sub%sp,imm12 (10) 

Instructions (9) and (10) are instruction codes created by an assembler Instruction (9) is an immediate data add in- 
15 struction for the stack pointer stored in the SP 14 and instruction (1 0) is an immediate data subtract instruction for this 
stack pointer The operand imm12 is 32-bit data obtained by shifting the 10-bit immediate data by two bits leftward 
then subjecting it to zero expansion. It is used for ooerations with the stack pointer that is stored in the SP 14. 
[0106] A bit field 630 of the stack pointer move instructions (9) and (10) is shown in Fig. 10A. The stack pointer move 
instruction of Fig. 10A has 16 bits of object code comprising op code 632 (6 bits) indicating whether the move data for 
20 the stack pointer stored in the SP 14 is an addition or subtraction and immediate data 634 (10 bits). This op code 632 
comprises common code indicating that this is a stack pointer move instruction that acts upon the SP 14 and different 
codes specifying either addition or subtraction. Since it is therefore clear from the op code that these instructions 
operated upon the stack pointer stored in tne SP 14, there is no necessity to specify data relating to the stack pointer 
in the operands of the object code. The immediate offset data 634 is used for creating an offset for subtracting from 
25 or adding to the stack pointer. 

[0107] By way of comparison, Fig. 1 0B shows an example of a bit field 640 of object code of an add/subtract instruction 
when a general-purpose register is used as the stack pointer, (hereinafter called a general-purpose calculation instruc- 
tion). 

[0108] The general-purpose calculation instruction of Fig. 10B has 20 bits of object code comprising op code 642 (6 
30 bits) indicating that the operating function is a addition or subtraction of immediate data to or from a value in a general- 
purpose register, immediate data 644 (10 bits), and a register number 646 (4 bits) specifying the register that is to be 
operated upon, with a general-purpose microcomputer, the instruction length is in 8-bit units, which gives 24-bit or 
32-bit instructions. 

[0109] Each of Figs. 1 0A and 10B shows object code for an instruction used when adding or subtracting immediate 
35 data to or from the stack pointer. It is clear from these examples that a stack pointer move instruction can be written 
with shorter object code than a general-purpose calculation instruction. 

[0110] The description now turns to the configuration required for executing me above instructions and the operations 
that occur during this execution, taking as an example the instruction (9) that adds immediate data to the SP (hereinafter 
called an add stack pointer move instruction) and the instruction (10) that subtracts immediate data from the SP (here- 

40 inafter called a subtract stack pointer move instruction). 

[0111] The hardware configuration necessary for executing the instructions will be described first, using Fig. 1 for 
reference. Each of these instructions is transferred over the l_DATA_BUS 94 from an external memory (ROM) 52 and 
is input to the instruction decoder 20. The instruction is decoded by the instruction decoder 20, which outputs various 
signals (not shown in the figure) necessary for executing the instruction. The immediate data generator 22 performs 

45 a logical shift by two bits to the left on the 1 0 bits of the immediate data 634, subjects it to zero expansion, and outputs 
it to the PA_BUS 72. The SP 14 contains the stack pointer, and that value is output to the XA_BUS 78. The XA_BUS 
78 is connected to the PB_BUS 74 which forms an input of the ALU 40. Another input of the ALU 40 is connected to 
the PA_BUS 72 that is an output of the immediate data generator 22. The outout of the ALU 40 is connected to the 
WW_BUS 76. This WW_BUS 76 is connected to the input of the SP. 

50 [01 12] The description first concerns The operation during the execution of the add stack pointer move instruction. 
[0113] When the add stack pointer move instruction is executed, the value of the stack pointer stored in the SP 14 
is added to. the offset created by the immediate data generator 22 on the basis of the immediate offset data 634, to 
create a new stack pointer. That value is stored in the SP 14. 

[01 14] A flowchart illustrating the operation during the add stack pointer move instruction is shown in Fig. 11. 
55 [0115] At the start of execution of this instruction, the stack pointer stored in the SP 1 4 is output to the XA_BUS 78 
(step S250). The data on the XA_BUS 78 is output to the PB_BUS 74 (step S252). Move immediate data imm created 
by the immediate data generator 22 on the basis of the immediate data is output to the PA_BUS 72 (step S254). The 
ALU 40 adds the value on the PB_BUS 74 and the value on the PA_BUS 72, and outputs the result to the WW_BUS 
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76 (step S256). The value on the WW_BUS 76 is then input to the SP 14 (step S258). The description now turns to 
the operation during the execution of the subtract stack pointer move instruction. 

[0116] When the subtract stack pointer move instruction is executed, the move immediate data created by the im- 
mediate data generator 22 on the basis of the immediate data 634 is subtracted from the value of the stack pointer 

5 stored in the SP 14 to create a new stack pointer That value is stored in the SP 14. 

[0117] A flowchart illustrating the operation during the subtract stack pointer move instruction is shown in Fig. 12. 
[01 18] At the start of execution of this instruction, the stack pointer stored in The SP 14 is output to the XA_BUS 78 
(step S260). The data on the XA_BUS 78 is then output to the PB_BUS 74 (step S262). Move immediate data imm 
created by the immediate data generator 22 on the basis of the immediate data is output to tne PA_BUS 72 (step 

10 S264) . The ALU 40 subtracts the value on the PA_BUS 72 from the value on the PB_BUS 74, and outputs the result 
to the WW_BUS 76 (step S266). The value on the WW_BUS 76 is then input to the SP 14 (step S268). 

(7) Branch Instruction 

15 [01 19] A view illustrating program execution control during the call and ret instructions is shown in Fig. 13. During a 
MAIN routine 300 shown in Fig. 13, a call instruction (302) is executed to branch to a subroutine SUB 310, so that 
control passes to the subroutine SUB 310. A ret instruction (312) is placed at the end of the subroutine. When this 
instruction is executed, the flow returns to the next the instruction (304) after the call instruction (302) of the MAIN 
routine 300. Therefore, the program is executed in the sequence <1>, <2>, then <3> in Fig. 13. At the end of the 

20 execution of this subroutine SUB 310, control returns to the MAIN routine 300 and execution is to continue from the 
next instruction (304) after the call instruction (302). This makes it necessary to record the address of the return des- 
tination somewhere, when a branch occurs to the subroutine SUB 310. Forthat reason, the return destination address 
is saved to the stack as shown in Fig. 4 during tne execution of a branch instruction that branches to a subroutine, 
sucn as the call instruction, then the return destination address is returned from the stack to the program counter during 

25 the execution of a branch instruction that returns frcm a subroutine, such as the ret instruction (this process is hereinafter 
called saving and restoring the program counter). 

[0120] With a prior-art RISC CPU, the process of saving and restoring the program counter is dene by software, so 
that it is necessary to provide object code (assembler instructions) for executing this process during the execution of 
a branch instruction such as the call instruction. For example, it is necessary to provide object code (assembler in- 
30 structions) for decrementing tne stack pointer by an amount equivalent to the word size (4) during the execution of the 
call instruction, to place the address of the next instruction after the call instruction in the stack on the basis of the 
value in the program counter. 

[0121] The CPU of this embodiment, however, has a hardware configuration that enables the saving and restoration 
of the program counter when the call or ret instruction is executed. It is therefore not necessary to provide object code 
35 (assembler instructions) that saves and restores the program counter separately from the object code of the call and 
ret instructions. 

[0122] To enable the CPU of this embodiment to execute this process of saving and restoring the program counter 
by a single instruction, the following instruction set is provided as branch instructions that are some of the dedicated 
stack pointer instructions: 

40 

callsign9 (11) 
call%Rb (12) 



Instructions (11) to (17) are instruction codes created by an assembler Instruction (11) is a PC-relative subroutine call 
50 instruction which uses the program counter (PC) as a base address, to branch in a relative manner to a branch desti- 
nation address on the basis of the displacement data sign9 specified as the operand. Instruction (12) is a register 
indirect subroutine call instruction which branches to the branch destination address contained within the register spec- 
ified as the operand. Instruction (13) is an instruction for return from a subroutine. Instruction (14) is an instruction for 
return from an interrupt or exception processing routine. Instruction (15) is an instruction for return from a debugging 
55 processing routine. Instruction (16) is a software interrupt instruction. Instruction (1 7) is a software/debugging interrupt 
instruction. 

[0123] An example of a bit field 650 of the PC-relative subroutine call instruction (11 ) is shown in Fig. 14. The PC- 
relative subroutine call instruction of Fig. 14 has 16 bits of object code comprising op code 652 (8 bits) indicating that - 
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int imm2 
brk 



ret 
reti 
retd 
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the operating function is a call instruction that branches to a subroutine with a branch destination address that is 
specified relatively, using the program counter as a base address, and displacement data sign9 (8 bits) 654 specified 
by immediate data. During the execution of this command, the 8-bit immediate data is shifted logically by one bit to 
the left then is subjected to sign expansion. 
5 [0124] The CPU of this embodiment is capable of saving the program counter to the stack at the execution of the 
call instruction, with only the object code shown in Fig. 14. 

[0125] The description now turns to the configuration required for executing the above instructions and the operations 
that occur during this execution, taking as examples the PC-relative subroutine call instruction (11) as an instruction 
for branching to a subroutine and the return instruction (13) as an instruction for returning from the subroutine. 

10 [0126] The hardware configuration necessary for executing the instructions will be described first, using Fig. 1 for 
reference. Each of these instructions is transferred over the l_DATA_BUS 94 from external memory (ROM) 52 and is 
input to the instruction decoder 20. The instruction is decoded by the instruction decoder 20, which outputs various 
signals (not shown in the figure) necessary for executing the instruction. The immediate data generator 22 performs 
a logical shift by one bit to the left on the displacement data 654, subjects it to sign expansion to create a 32-bit 

15 immediate data displace-ment imm, then outputs the result to the PB_BUS 74. The SP 14 contains the stack pointer, 
and that value is output to the XA_BUS 78 which is connected to an input of the address adder 30. Another input of 
the address adder 30 is connected to the PB_BUS 74 that is an output of the immediate data generator 22. An output 
(ADDR) of the address adder 30 is connected to the external l_ADDR_BUS 92 by the IA signal line 82. 
[0127] The l_ADDR_BUS 92 and l_DATA_BUS 94 are connected to the ROM 52 in which is stored the instruction 

20 object code. The bus control unit (BCU) 60 outputs READ control signals that read this instruction object code from 
the memory (ROM) 52 in accordance with various request signals that are output from the CPU (such as signals output 
to external buses). 

[0128] The description first concerns the operation during the execution of an PC-relative subroutine call instruction. 
[0129] When the PC-relative subroutine call instruction is executed, the value of the program counter stored in the 
25 PC 12 is saved to the stack as described with reference to Fig. 4, and the value of the stack pointer stored in the SP 
14 is decremented by the word size (4). A branch destination address obtained by adding the program counter and 
the 32-brt immediate data displacement is set in the PC 12. 

[0130] A flowchart illustrating the operation during the PC-relative subroutine call instruction is shown in Fig. 15. 
[0131] At the start of execution of this instruction, the stack pointer stored in the SP 14 is output to the XA_BUS 78 

30 (step S270). This forms one input of the address adder, and constant data (-4) created by the immediate data generator 
22 forms the other input of the address adder 30. The constant -4 is added to the value of the stack pointer on the 
XA_BUS 78 by the address adder 30 to create a write address to the stack for storing the return address, and the result 
is output to the WW_BUS 76. This write address is output to the D_ADDR_BUS 96 (step S272). The value of the 
program counter stored in the PC 12 is incremented by 2 by the PC incrementer 44 to create the return address, and 

35 the result is output to the D_DATA_BUS 98 over the DOUT signal line 88 (steo S274) . A data write request signal from 
the CPU to the BCU 60 becomes active and executes an external memory write cycle (step S276). In other words, the 
BCU 60 follows the request signal to store this return address in tne stack provided in memory, using this write address 
as a memory address. 

[0132] The value on the WW_BUS 76 is then output to the SP 14 (step S278). In other words, the value of the stack 

40 pointer is updated by being decremented by 4. 

[0133] The program counter stored in the PC 12 is then output to the XA_BUS 78 (step S280). The immediate data 
generator 22 performs a logical shift by one bit to the left on the displacement data 654, subjects it to sign expansion 
to create a 32-bit immediate data displacement imm, then outputs the result to the PB_BUS 74. The address adder 
30 adds the program counter on the XA_BUS 78 and the immediate data displacement imm on the PB_BUS 74 to 

45 create a branch address (ADDR), then outputs that address over the IA signal line 82 to the l_ADDR_BUS 92 (step 
S282). An instruction read request signal from the CPU to the BCU 60 becomes active and executes a read cycle with 
respect to the external memory (ROM) 52 to fetch the object code of the branch destination instruction (step S284). 
[0134] The operation during the execution of the ret instruction will now be described. 

[0135] When the ret instruction is executed, the value of the program counter that was saved to the stack is restored 
so to the PC 12, and the value of the stack pointer stored in the SP 14 is incremented by the word size (4), as described 
with reference to Fig. 4. 

[0136] A flowchart illustrating the operation during the ret instruction is shown in Fig. 16. 

[0137] Before this instruction is executed, the stack pointer stored in the SP 14 indicates the address in the stack 
containing the destination address for return from the called routine. 
55 [01 38] At the start of execution of this instruction, the stack pointer stored in the SP 14 is output to the XA_BUS 78 
(step S290). The stack pointer on the XA_BUS 78 is then output to the D_ADDR_BUS 96 (step S292). A data read 
request signal from the CPU to the BCU 60 becomes active and executes an external memory read cycle. In other 
words, the BCU 60 follows the request signal to read the return address from the stack provided in memory, using the 
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stack pointer as a memory address. This return destination address that has been read from the memory (RAM) 50 is 
fetched within the CPU from the D_DATA_BUS 94 over the DIN signal line 86, then is output from the DIN signal line 
86 to the l_ADDR_BUS 92 over the IA signal line 82 (step S294). An instruction read request signal from the CPU to 
the BCU 60 becomes active and executes a read cycle with respect to the external memory (ROM) 52 to fetch the 
5 object code of the return destination instruction (step S296). 

[0139] The value of the stack pointer on the XA_BUS 78 is one input of the address adder 30, and constant data 
(+4) created by the immediate data generator 22 forms the other input of the address adder 30. The constant +4 is 
added to the value of the stack pointer on the XA_BUS 78 by the address adder 30 to create the address of the top 
area of the stack region secured by the return destination routine, and the result is output to the WW_BUS 76 (step 
10 S298). This address on the WW_BUS 76 (the address of the top area of the stack region secured by the return desti- 
nation routine) is then output to the SP 14 (step S300). 

(8) Description of Sequential Push Instruction (pushn)and Sequential Pop Instruction (popn) 

[0140] As described previously, it has recently become common to have a configuration with a large number of 
internal general-purpose registers, particularly in RISC CPUs, to enable processes to run rapidly within the CPU itself, 
without having to access memory. The CPU of this embodiment incorporates 16 general-purpose registers, designed 
to increase the speed of processing. However, if there are so many internal registers, the contents of a large number 
of registers have to be saved during processing that saves and restores registers when an interrupt occurs or a sub- 
routine is called. 

[0141] When the contents of these registers are saved or restored in the prior art, a push instruction for storing the 
contents specified by an address or a pop instruction for fetching the contents of the stack to a register is used. The 
actions of these ordinary push and pop instructions will now be described. 

[0142] The actions performed during the execution of push instructions are shown sch erratically in Figs. 17A and 
1 7B and the actions during the execution of pop instructions are shown schematically in Figs. 1 8A and 1 8B. The actions 
performed when data is transferred between a plurality of general-purpose registers and the stack will now be described, 
with reference to Figs. 17A, 17B, 18A, and 18B. 

[0143] Fig. 17A shows the actions when a 'push RV instruction is executed, to write the contents of a general-purpose 
register R1 to the stack. During the execution of this instruction, the contents oftheSP 14 are updated by the subtraction 
of 4 from the current value (1000 in Fig. 17A is updated to 996). The contents 'a' of the general-purpose register R1 
are written to 996, which is the memory address indicated by the updated stack pointer in the SP 14. 
[0144] Fig. 1 7B shows the actions when a 'push R2' instruction is executed to further write the contents of a general- 
purpose register R2 to the stack. During the execution of this instruction, the contents of the SP 14 are updated by the 
subtraction of 4 from the current value (996 in Fig. 17B is updated to 992). The contents 'b' of the general-purpose 
register R2 are written to 992, which is the memory address indicated by the updated stack pointer in the SP 14. 
[0145] Fig. 18A shows the actions when a 'pop R2' instruction is executed to fetch the contents of the stack to the 
general-purpose register R2. During the execution of this instruction, the contents 'b' stored at the memory address 
992 indicated by the stack pointer in the SP 14 are fetched and stored in the general-purpose register R2. The contents 
of the SP 14 are updated by the addition of 4 to the current value (the pre-execution value of 992 in Fig. 18A is updated 
to 996). 

[0146] Finally, Fig. 18B shows the actions when a 'pop RV is executed to further fetch the contents of the stack to 
the general-purpose register R1. During the execution of this instruction, the contents 'a' stored at the memory address 
996 indicated by the stack pointer in The SP 14 are fetched and stored in the general-purpose register R1. The contents 
of the SP 14 are updated by the addition of 4 to the current value (the pre-execution value of 996 in Fig. 18B is updated 
to 1000). 

[0147] When data is transferred between a plurality of general-purpose registers and tne stack in this example of 
the prior art, it is necessary to execute push or pop instructions a plurality of times. This is because each execution of 
a push or pop instruction can only operate on a single register. 

[0148] Therefore, when processing is performed to save data from a number of registers to the stack, or recover 
data from the stack to a number of registers, the resultant increase in the number of instructions will lead toan increase 
in the size of the object code. The number of execution steps in the program will also increase, leading to delays in 
the execution time or processing of the program. 

[0149] The following instruction set is provided for the CPU of this embodiment 

55 pushn%Rs (18) 
popn%Rd (19) 

Instruction (18) is a sequentialoush instruction created by an assembler. It sequentially pushes the contents of n gen- 
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eral-purpose registers (where n is an integer from 1 to 16) from %Rsto RO onto the stack. Instruction (19) is a sequential 
pop instruction created by an assembler It sequentially pops n items of word data (where n is an integer from 1 to 16) 
from the stack and pushes them into general-purpose registers %Rd to RO. Each of the pushn and popn instructions 
consists of op code and an operand. The operand %Rs of the pushn instruction indicates the register number of Rs 

5 when data is to be written to the stack from registers %Rs to RO. The operand %Rd of the popn instruction indicates 
the register number of Rd when data is to be taken from the stack and written to registers RO to %Rd. 
[0150] The bit structure of the pushn and popn instructions is shown in Fig. 19. The lowermost 4-bit field contains a 
code indicating %Rs or %Rd; any of the 16 general-purpose registers can be specified. When data is to be transferred 
between a special register and the stack, it is transferred via a general-purpose register. 

10 [0151] A block diagram of the hardware configuration required for executing the sequential push instruction (pushn) 
or sequential pop instruction (popn) is shown in Fig. 20. Portions necessary for the description of the sequential push 
instruction (pushn) and sequential pop instruction (popn) are extracted from Fig. 1, and further necessary portions are 
added. Portions that are the same as those in Fig. 1 are given the same reference numbers. In this figure, reference 
number 11 denotes 16 general-purpose registers called R0 to R15. These general-purpose registers 11 transfer data 

15 to and from the D_DATA_BUS 98. A register selection address signal 54 for selecting a register comes from a control 
circuit block 45. The stack pointer is stored in the SP 14. The value in the SP 14 can be output to the D_ADDR_BUS 
96 and the internal address bus (XA_BUS) 78 that is connected to an input of the 32-bit address adder 30. Another 
input of the address adder 30 is connected to an offset signal 24 that is output from the control circuit block 45. An 
output of the address adder 30 is held by a latch (Add_LT) 32, then is further output to the XA_BUS 78 or tne WW_BUS 

20 76. The WW_BUS 76 is connected to the input of the SP 14. There is a 4-bit counter (countx) 46 in the control circuit 
block 45, to count the number of registers involved in the data transfer. The control circuit block 45 is further provided 
with an instruction register (not shown in the figure), which holds the operand %Rs or %Rd of the pushn or popn 
instruction and also outputs control signals in accordance with instructions from the instruction decoder. In this figure, 
reference number 60 denotes a bus control unit (BCU) that controls the input and output of data to and from external 

25 memory (RAM) 50, which includes the stack area and outputs READ and WRITE control signals. 

[0152] The description first concerns the operation during the execution of a sequential push instruction (pushn). 
[01 53] When the sequential push instruction (pushn) is executed, the contents stored in the general-purpose registers 
are sequentially pushed into the stack, starting with the register with register number %Rs to general-purpose register 
RO, as described previously with reference to Fig. 4. 

30 [0154] A flowchart illustrating the operation during the pushn instruction is given in Fig. 21 . 

[0155] At the start of execution of the pushn instruction, the offset of a offset signal 24 is -4. The counter (countx) 
46 is cleared to zero and the stack pointer stored in the SP 14 is output to the XA_BUS 78 (step S100). 
[01 56] The address adder 30 then adds -4 to the value on the XA_BUS 78 and inputs the result to the latch (Add_LT) 
32 (step S1 01). 

35 [0157] The value in the latch (Add_LT) 32 is then outputto the D_ADDR_BUS 96. The difference between the value 
of %Rs held in the lowermost 4 bits of the instruction register and the value in counter (countx) 46 is calculated by the 
control circuit block 45 and output tc the register selection address signal 54. The register selected by this signal 54 
is linked to the D_DATA_BUS 98 and a write is performed with respect to the external memory (RAM) 50 (step S102). 
[0158] In a stepS103, the counter (countx) 46 is compared with %Rs. If they match, the writing of registers to external 

40 memory has been completed and the execution of the pushn instruction ends by the writing of the value in the latch 
(Add_LT) 32 to the SP 14 through the WW_BUS 76 (step S1 04). 

[01 59] If they do not match, the counter (countx) is incremented by 1 , the value in the latch (Add_LT) 32 is output to 
the XA_BUS 78, and the processing from step S101 onward is repeated (step S105). 
[0160] The operation during the execution of a sequential pop instruction (popn) will next be described. 
45 [0161] When a sequential pop instruction (popn) is executed, the contents of the stack are sequentially pushed from 
the general-purpose register R0 to the general-purpose register with register number %Rd, as described above with 
reference to Fig. 4. 

[0162] A flowchart illustrating the operation during the popn instruction is shown in Fig. 22. 
[0163] At the start of execution of the popn instruction, the offset of a offset signal 24 is +4. The counter (countx) 46 
50 is cleared to zero and the stack pointer stored in the SP 14 is output to the XA_BUS 78 and the D_ADDR_BUS 96 
(step S110). 

[0164] The address adder 30 then adds 4 to the value in the XA_BUS 78 and inputs the result to the latch (Add_LT) 
32(stepS111). 

[0165] An external memory read cycle is then performed. The thus-read data is written to the general-purpose reg- 
55 jsters 11 through the D_DATA_BUS 98. During this time, the counter (countx) 46 is output to the register selection 
address signal 54 by the control circuit block 45 (step S112). 

[0166] In a step S1 13, the counter (countx) 46 is compared with %Rs. If they match, the reading from external memory 
to registers has been completed and the execution of the popn instruction ends by the writing of the value in the latch 
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(Add_LT) 32 to the SP 14 through the VWV_BUS 76 (step S114). 

[0167] If they do not match, the counter (countx) is incremented by 1 , the value in the latch (Add J_T) 32 is output to 
the XA_BUS 78 and the D_ADDR_BUS 96, and the processing from step S111 onward is repeated (step S115). 
[0168] Thus the 'pushn %Rs' instruction makes it possible to write data from registers %Rs to R0 to the stack and 

5 the 'popn %Rd' instruction makes it possible to return the necessary number of data items from the stack to registers 
R0 to %Rd. Thus data from registers R3 to R0 can be pushed by the execution of 'pushn %Rs\ for example. 
[0169] A further increase in efficiency is achieved in this case by the addition of a restriction over the way in which 
registers are used. In other words, the saving and restoration of register is particularly necessary when a program 
branches to another routine, such as during interrupt processing or when a subroutine is called. It is preferable that 

10 each routine called in such a case, such as another interrupt routine or subroutine, uses the registers in sequence 
from R0. This ensures that the contents of registers can be efficiently saved or restored by using the pushn instruction 
or popn instruction, without having to include registers from R0 to Rd or Rs that contain data that does not need to be 
saved. All of the registers in this embodiment have the same functions, and there are no other restrictions on the usage 
of registers and the instruction, so that this restriction solves the above problem without causing further problems. 

15 [01 70] Therefore, use of the pushn instruction and popn instruction of this embodiment makes it possible to implement 
efficient saving of data from registers to the stack in memory and the restoration of data from the stack to registers by 
a single instruction of either pushn or popn. This minimizes the object code size and the number of program execution 
steps, and also means that the number of execution cycles can be reduced to a single instruction fetch and the nec- 
essary number of data transfers, enabling a minimization of the number of cycles. This makes it possible to increase 

20 the speed of interrupt processing routines and subroutines. 

[0171] Since the structural elements required for this execution are a count means and a simple sequence controller, 
it is possible to implement a data processing circuit with a small number of gates so that this invention can be applied 
to a one-chip microcomputer. . 

[0172] The above description concerned the configuration required for executing the sequential push instruction 
25 (pushn) and sequential pop instruction (popn) when a dedicated stack pointer register is used as the SP 14, but it 
should be obvious to those skilled in the art that the present invention can also be applied to a structure in which any 
general-purpose register is used as tne stack pointer 

Embodiment 2 

30 

[0173] A block diagram of the hardware of a microcomputer in accordance with this embodiment is shown in Fig. 23. 
[01 74] This microcomputer 2 is a 32-bit microcontroller that comprises a CPU 1 0; ROM 52; RAM 50; a high-frequency 
oscillation circuit 910; a low-frequency oscillation circuit 920; a reset circuit 930; a prescaler940; a 16-bit programmable 
timer 950; a 8-bit programmable timer 960; a clock timer 970; an intelligent DMA 980; a high-speed DMA 990; an 
35 interrupt controller 800; a serial interface 81 0; a bus control unit (BCU) 60; an A/D converter 830; a D/A converter 840; 
an input port 850; an output port 860; and an I/O port 870; as well as various buses 92, 94, 96, and 98 that are con nected 
to these components and various pins 890. 

[0175] The CPU 10 has an SP that is a dedicated stack pointer register, and it decodes and executes the dedicated 
stack pointer instructions as described above. This CPU 10 has the same configuration as that of the previously de- 
40 scribed Embodiment 1 and it functions as a decoding means and execution means. 

[0176] The microcomputer of this embodiment can therefore enable efficient coding and execution of processes that 
act upon the stack pointer, with short instructions. 

[0177] The processes of saving and restoring registers can also be coded efficiently, so that interrupt processing 
and subroutine call/return can be performed rapidly. 
45 [0178] The microcomputer of this invention can be applied to peripheral equipment for personal computers, such as 
printers, or electronic equipment such as portable appliances, for example. Such applications of the invention make it 
possible to provide inexpensive, but sophisticated, electronic equipment that is facilitated by a data processing circuit 
that has a fast processing speed and a highly efficient usage of memory. 

[0179] Note that this invention is not to be taken as being limited to the above described embodiments; it can be 
so embodied in various other ways. 

Claims 

55 1. A data processing circuit comprising: 

a dedicated stack pointer register that is used only for the stack pointer (14); 

decoding means (20) for decoding object code of a group of dedicated stack pointer instructions which have 
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object code specifying said dedicated stack pointer register as an implicit operand and which relate to process- 
ing based on said dedicated stack pointer register, and for outputting a control signal based on said object code, 
execution means (10) for executing said group of dedicated stack pointer instructions based on said control 
signal and the contents of said dedicated stack pointer register; and 

a plurality of registers (11) provided in a contiguous sequence, wherein said group of dedicated stack pointer 
instructions comprises at least one of a sequential push instruction and a sequential pop instruction having 
data for specifying a plurality of registers in the object code thereof; 

said decoding means (20) decodes at least one of said sequential push instruction and said sequential pop 
instruction; and 

said execution means (10) performs at least one of a process of executing a plurality of sequential pushes 
of data from said plurality of registers to a stack provided in memory and a process of executing a plurality of 
sequential pops of data from said stack to said plurality of registers, based on the contents of a memory address 
specified by said dedicated stack pointer register (14) and said data for specifying a plurality of registers, during 
the execution of at least one of said sequential push instruction and said sequential pop instruction; 

characterized in that 

said execution means (10) comprises: 

write means for writing the contents of a given register that is one of said plurality of registers (11) to a stack 
provided in memory (SOB), based on a memory address specified by said dedicated stack pointer register (14); 
number-of-times-written count means (46) for counting the number-of-times-written of said write means; and 
comparison means (45) for comparing said number-of-times-written counted by said count means with the 
value of said data for specifying a plurality of registers; 

wherein said write means comprises: 

write memory address generation means (30, 32) for adding a first input and a second input by an adder, and 
for generating a write memory address for specifying a write destination: 

first input control means (45) for providing control such that the first input of said adder is the contents of said 
dedicated stack pointer register at the start of execution of a sequential dedicated instruction, and is a write 
address generated subsequently by a write address generation means; 

second input control means (45) for outputting an offset used during the writing of one word to said stack to 
the second input of said adder; and 

write means (45) for writing to said stack the contents of a register specified by subtraction processing which 
uses said data for specifying a plurality of registers and said number-of times-written, based on said write 
memory address; 

whereby the writing of the contents of said plurality of registers to said stack and the ending of said writing 
are controlled based on the comparison result of said comparison means. 

A data processing circuit comprising: 

a dedicated stack pointer register that is used only for the stack pointer (14); 

decoding means (20) for decoding object code of a group of dedicated stack pointer instructions which have 
object code specifying said dedicated stack pointer register as an implicit operand and which relate to process- 
ing based on said dedicated stack pointer register, and for outputting a control signal based on said object code; 
execution means (10) for executing said group of dedicated stack pointer instructions based on said control 
signal and the contents of said dedicated stack pointer register, and 
a plurality of registers (11 ) provided in a contiguous sequence, 

wherein said group of dedicated stack pointer instructions comprises at least one of a sequential push in- 
struction and a sequential pop instruction having data for specifying a plurality of registers in the object code thereof; 

said decoding means (20) decodes at least one of said sequential push instruction and said sequential pop 
instruction; and 

said execution means (1 0) performs at least one of a process of executing a plurality of sequential pushes 
of data from said plurality of registers to a stack provided in memory and a process of executing a plurality of 
sequential pops of data from said stack to said plurality of registers, based on the contents of a memory address 
specified by said dedicated stack pointer register (14) and said data for specifying a plurality of registers, during 
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the execution of at least one of 
characterized in that 
said execution means (10) comprises: 

read means for reading the contents of a stack provided in memory based on a memory address specified by 
said dedicated stack pointer register, and storing said contents in a given register of said plurality of registers; 
number-of-times-read count means for counting the number-of-times-read of said read means; and 
comparison means (45) for comparing said number-of-times-read counted by said count means with the value 
of said data for specifying a plurality of registers; 

wherein said read means comprises: 

first input control means (45) for providing control such that the first input of an adder is the contents of said 
dedicated stack pointer register at the start of execution of a sequential dedicated instruction, and is a read 
memory address generated subsequently by a read address generation means; 

second input control means (45) for outputting an offset used during the reading of one word from said stack 
to the second input of said adder; 

read means (45) for reading the contents of said stack based on said read memory address and storing said 
contents in a register specified based on said number-of-times-read; 

whereby the reading of the contents of said stack and the ending of said reading are controlled based on 
the comparison result of said comparison means. 

The data processing circuit as defined in claim 1 or 2, further comprising: 

n general-purpose registers (11) specified by register numbers 0 to n-1; 

wherein object code of at least one of said sequential push instruction and said sequential pop instruction 
comprises a final register number to which one of said register numbers is specified, as said data for specifying a 
plurality of registers; and 

said execution means (10) performs at least one of a process of executing a plurality of sequential pushes 
of data into a stack provided in memory from a plurality of registers starting from register 0 to a register specified 
by said final register number and a process of executing a plurality of sequential pops of data from said stack to 
a plurality of registers starting from register 0 to a register specified by said final register number, based on the 
contents of a memory address specified by said dedicated stack pointer register. 

The data processing circuit as defined in any one of claims 1 to 3, further comprising: 

a program counter register (12) used only for the program counter; 

wherein said group of dedicated stack pointer instructions comprises branch instructions that are an instruc- 
tion for branching to a subroutine and a return instruction from said subroutine; 
said decoding means (20) decodes said branch instructions; 
said execution means (10) comprises: 

means for executing at least one of a process of saving the contents of said program counter register (12) to 
a given second area of said stack provided in memory and a process of restoring the contents of said second 
area to said program counter register (12), based on a memory address specified by said dedicated stack 
pointer register, during the execution of said branch instruction; and 

means for updating the contents of said dedicated stack pointer register (12) based on said saving and res- 
toration. 

The data processing circuit as defined in any one of claims 1 to 4, wherein said circuit uses instructions of a reduced 
instruction set computer. 

The data processing circuit as defined in any one of claims 1 to 5 t wherein: 

a fixed-length instruction is decoded and execution is based on said fixed-length instruction. 



18 



EP 0 809 180 B1 



7. A microcomputer comprising the data processing circuit as defined in any one of claims 1 to 6, storage means, 
and input and output means for inputting data from and for outputting data to external devices. 

8. The microcomputer as defined in claim 7, 

wherein a program that is executed thereby uses a program language that secures a storage region for auto- 
variables by using said stack pointer. 

9. Electronic equipment comprising the microcomputer defined in claim 7 or 8. 
Patentanspruche 

1. Datenverarbeitungsschaltung, aufweisend: 

ein zweckgebundenes Stapelzeigerregister, das nur fur den Stapebeiger (14) benutzt wird; 
eine Dekodiereinrichtung (20) zum Dekodieren von Objektcode einer Gruppe zweckgebundener Stapelzei- 
gerbefehle, die einen Objektcode haben, der das zweckgebundene Stapelzeigerregister als einen implizierten 
Operanden spezifiziert, und sich auf das Verarbeiten aufgrund des zweckgebundenen Stapelzeigerregisters 
beziehen, sowie zum Ausgeben eines Steuersignals aufgrund des Objektcodes, 

eine Ausfuhrungseinrichtung (10) zum Ausfuhren der Gruppe zweckgebundener Stapelzeigerbefehle auf- 
grund des Steuersignals und des Inhalts des zweckgebundenen Stapelzeigerregisters; und 
eine Vielzahl von Registem (11), die in einer aneinandergrenzenden Folge vorgesehen sind, wobei die Gruppe 
zweckgebundener Stapelzeigerbefehle mindestens einen von zwei Befehlen, namlich einen sequentietlen Pu- 
sh-Befehl und einen sequentietlen Pop-Befehl, aufweist, die Daten zum Spezifizieren einer Vielzahl von Re- 
gistern in ihrem Objektcode haben; 

wobei die Dekodiereinrichtung (20) mindestens einen der beiden Befehle, namlich den sequentiellen Push- 
Befehl und den sequentiellen Pop-Befehl, dekodiert; und 

die AusfQhrungseinrichtung (10), wahrend der AusfOhrung mindestens eines der beiden Befehle, des se- 
quentiellen Push-Befehls und des sequentiellen Pop-Befehls, mindestens einen von zwei Prozessen, namlich 
einen ProzeG der AusfOhrung einer Vielzahl sequentieller Pushs von Daten aus der Vielzahl von Registern auf 
einen im Speicher vorgesehenen Stapel und einen Prozefc der AusfOhrung einer Vielzahl sequentieller Pops von. 
Daten von dem Stapel an die Vielzahl von Registern, aufgrund des Inhalts einer von dem zweckgebundenen 
Stapelzeigerregister (14) spezifizierten Speicheradresse und der Daten zum Spezifizieren einer Vielzahl von Re- 
gistern, durchfuhrt; 

dadurch gekennzeichnet, daB die Ausfuhrungseinrichtung (10) folgendes aufweist 

eine Schreibeinrichtung zum Schreiben des Inhalts eines gegebenen Registers, welches eines der Vielzahl 
von Registern (11) ist, in einen im Speicher (SOB) vorgesehenen Stapel aufgrund einer von dem zweckgebun- 
denen Stapelzeigerregister (14) spezifizierten Speicheradresse; 

eine Schreibfrequenz-Zahleinrichtung (46), die zahlt, wie oft die Schreibeinrichtung geschrieben hat; und 
eine Vergleichseinrichtung (45), die die von der Zahleinrichtung gezahlte Schreibfrequenz mit dem Wert der 
Daten zum Spezifizieren einer Vielzahl von Registern vergleicht; 

wobei die Schreibeinrichtung folgendes aufweist: 

eineSchreibspeicheradressen-Generatoreinrichtung (30, 32), die eine erste Eingabe und eine zweite Eingabe 
durch einen Addierer summiert und eine Schreibspeicheradresse generiert, um ein Schreibzielzu spezifizie- 
ren; 

eine erste Eingabesteuereinrichtung (45), die eine derartige Steuerung bietet, daG die erste Eingabe des Ad- 
dierers der lnhalt des zweckgebundenen Stapelzeigerregisters beim Start der AusfOhrung eines sequentiellen 
zweckgebundenen Befehls ist und eine Schreibadresse ist, die anschliefcend von einer Schreibadressenge- 
neratoreinrichtung generiert wird; 

eine zweite Eingabesteuereinrichtung (45), die an den zweiten Eingang des Addierers einen Versatz ausgibt, 
der beim Schreiben eines Wortes an den Stapel benutzt wird; und 

eine Schreibeinrichtung (45), die in den Stapel den lnhalt eines durch Subtraktionsverarbeitung spezifizierten 
Registers schreibt, fur die die Daten zum Spezifizieren einer Vielzahl von Registern und die Schreibfrequenz 
aufgrund der Schreibspeicheradresse benutzt wird; 
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wodurch das Schreiben des tnhalts der Vielzahl von Registern in den Stapel und das Beenden des Schrei- 
bens aufgrund des Vergleichsergebnisses der Vergleichseinrichtung gesteuert werden. 

Datenverarbeitungsschaltung, aufweisend: 

ein zweckgebundenes Stapelzeigerregister, das nur fur den Stapelzeiger (14) benutzt wird; 
eine Dekodiereinrichtung (20) zum Dekodieren von Objektcode einer Gruppe zweckgebundener Stapelzei- 
gerbefehle, die einen Objektcode haben, derdaszweckgebundene Stapelzeigerregister a Is einen implizierten 
Operanden spezifiziert, und sich auf das Verarbeiten aufgrund des zweckgebundenen Stapelzeigerregisters 
beziehen, sowie zum Ausgeben eines Steuersighals aufgrund des Objektcodes, 

eine Ausfuhrungseinrichtung (10) zum Ausfuhren der Gruppe zweckgebundener Stapelzeigerbefehte auf- 
grund des Steuersignals und des Inhalts des zweckgebundenen Stapelzeigerregisters; und 
eine Vielzahl von Registern (11), die in einer aneinandergrenzenden Folge vorgesehen sind, 

wobei die Gruppe zweckgebundener Stapelzeigerbefehle mindestens einen von zwei Befehlen, namlich ei- 
nen sequentiellen Push-Befehl und einen sequentieiien Pop-Befehl, aulweist, die Daten zum spezifizieren einer 
Vielzahl von Registern in ihrem Objektcode haben; 

wobei die Dekodiereinrichtung (20) mindestens einen der beiden Befehle, namlich den sequentiellen Push- 
Befehl und den sequentiellen Pop-Befehl, dekodiert; und 

die Ausfuhrungseinrichtung (10), wahrend der Ausfuhrung mindestens eines der beiden Befehle, des se- 
quentiellen Push-Befehls und des sequentiellen Pop-Befehls, mindestens einen von zwei Prozessen, namlich 
einen ProzeB der Ausfuhrung einer Vielzahl sequentieller Pushs von Daten aus der Vielzahl von Registern auf 
einen im Speicher vorgesehenen Stapel und einen ProzeR der AusfOhrung einer Vielzahl sequentieller Pops von 
Daten von dem Stapel an die Vielzahl von Registern, aufgrund des Inhalts einer von dem zweckgebundenen 
Stapelzeigerregister (14) spezifizierten Speicheradresse und der Daten zum Spezifizieren einer Vielzahl von Re- 
gistern durchfuhrt; 

dadurch gekennzeichnet, daS die Ausfuhrungseinrichtung (10) folgendes aufweist 

eine Leseeinrichtung zum Lesen des Inhalts eines im Speicher vorgesehenen Stapels aufgrund einer von 
dem zweckgebundenen Stapelzeigerregister spezifizierten Speicheradresse und zum Speichern des Inhalts 
in einem gegebenen Register der Vielzahl von Registern; 

eine Lesefrequenz-Zdhleinrichtung, diezShrt, wie oft die Leseeinrichtung gelesen hat; und 

eine Vergleichseinrichtung (45), die die von der Zahleinrichtung gezahlte Lesefrequenz mit dem Wert der 

Daten zum Spezifizieren einer Vielzahl von Registern vergleicht; 

wobei die Leseeinrichtung folgendes aufweist: 

eine erste Eingabesteuereinrichtung (45), die eine derartige Steuerung bietet, daS die erste Eingabe eines 
Addierers der Inhait des zweckgebundenen Stapelzeigerregisters beim Start der AusfOhrung eines sequenti- 
ellen zweckgebundenen Befehls ist und eine Lesespeicheradresse ist, die anschlieUend von einer Leseadres- 
sengeneratoreinrichtung generiert wird; 

eine zweite Eingabesteuereinrichtung (45), die an den zweiten Eingabe des Addierers einen Versatz ausgibt, 
der beim Lesen eines Wortes von dem Stapel benutzt wird; und 

eine Leseeinrichtung (45), die den Inhait des Stapels aufgrund der Lesespeicheradresse liest und den Inhait 
in einem Register speichert, das aufgrund der Lesefrequenz spezifiziert wurde, 

wodurch das Lesen des Inhalts des Stapels und das Beenden des Lesens aufgrund des Vergleichsergeb- 
nisses der Vergleichseinrichtung gesteuert werden. 

Datenverarbeitungsschaltung nach Anspruch 1 oder 2, ferner aufweisend: 

n Mehrzweck-Register(H), die durch Registernummern 0 bis n-1 spezifiziert sind; 

wobei der Objektcode mindestens eines der beiden Befehle, des sequentiellen Push-Befehls und des se- 
quentiellen Pop-Befehls, als Daten zum Spezifizieren einer Vielzahl von Registern eine letzte Registernummer 
umfa&t, auf die eine der Registernummern spezifiziert ist; und 

die Ausfuhrungseinrichtung (10) mindestens einen von zwei Prozessen, namlich einen ProzeG der Ausfuh- 
rung einer Vielzahl sequentieller Pushs von Daten auf einen im Speicher vorgesehenen Stapel von einer Vielzahl 
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von Registern, beginnend mitdem Register 0 bis zu einemvon derletzten Registernummerspezifizierten Register, 
und einen ProzeU zur AusfUhrung einer Vielzahl sequentielier Pops von Daten aus dem Stapel zu einer Vielzahl 
von Registern, beginnend mrt dem Register 0 bis zu einemvon derletzten Registernummerspezifizierten Register, 
aufgrund des In halts einer Speicheradresse durchfuhrt, die von dem zweckgebundenen Stapetzeigerregister spe- 
zifiziert ist. 

4. Datenverarbeitungsschaltung nach einem der Anspruche 1 bis 3, ferner aufweisend: 

ein Programmzahlerregister (12), welches nurfur den Programmzahler verwendet wird; 

wobei die Gruppezweckgebundener Stapelzeigerbefehle Verzweigungsbefehle aufweist, die ein Befehl zum 
Abzweigen zu einer Subroutine und ein RQckkehrbefehl von der Subroutine sind; 
die Dekodiereinrichtung (20) die Verzweigungsbefehle dekodiert; und 
die Ausfuhrungseinrichtung (10) fblgendes aufweist: 



eine Einrichtung zur Ausfuhrung mindestens eines von zwei Prozessen, namlich eines Prozesses der Siche- 
rung des Inhalts des Programmzahlerregisters (12) zu einem gegebenen zweiten Bereich des im Speicher 
vorgesehenen Stapels und eines Prozesses zum Wiederherstellung des Inhalts des zweiten Bereiches in dem 
Programmzahlerregister (12) aufgrund einer von dem zweckgebundenen Stapetzeigerregister spezifizierten 
20 Speicheradresse wahrend der Ausfuhrung des Verzweigungsbefehls; und 

eine Einrichtung zum Aktualisieren des Inhalts des zweckgebundenen Stapelzeigerregisters (12) aufgrund 
der Sicherung und Wiederherstellung. 

5. Datenverarbeitungsschaltung nach einem der Anspruche 1 bis 4, bei der die Schaltung Befehle eines reduzierten 
25 Befehlssatzrechners benutzt 

6. Datenverarbeitungsschaltung nach einem der Anspruche 1 bis 5, bei der ein Befehl fester Lange dekodiert wird 
und die Ausfuhrung auf dem Befehl fester Lange beruht 

30 7. Mikrorechner mit der Datenverarbeitungsschattung nach einem der Anspruche 1 bis 6, einer Speichereinrichtung 
und Eingabe- und Ausgabeeinrichtungen zur Eingabe von Daten von und zur Ausgabe von Daten an externe 
Vorrichtungen. 

8. Mikrorechner nach Anspruch 7, bei dem ein Programm, das von ihm ausgefuhrt wird, eine Programmsprache 
35 benutzt, die eine Speicherzone fur Auto-Variable durch Benutzung des Stapelzeigers sicherstellt 

9. Elektronisches Gerat, welches den-in Anspruch 7 oder 8 beschriebenen Mikrorechner aufweist 

40 Revendications 

1 . Circuit de traitement de donnees comprenant : 

un registre dSdie de pointeur de pile, qui est utilise uniquement pour le pointeur (14) de pile ; 
45 un moyen (20) de d6codage, destine a decoder un code objet d'un groupe destructions dedie de pointeur de 

pile qui comportent un code objet sp^cifiant le registre dedie de pointeur de pile, en tant qu'op6rande implicite 
et qui se rapportent a un traitement fonde sur le registre dedie de pointeur de pile, et destine a delivrer un 
signal de commande, sur la base du code objet, 

un moyen (10) d'execution destine a executer le groupe destructions dedie de pointeur de pile, sur la base 
50 du signal de commande et du contenu du registre dedie de pointeur de pile; et 

une pluralite (11) de registres, prevue dans une suite contigue, dans laquelle le groupe destructions dedie 
de pointeur de pile comprend au moins Tune d'une instruction de chargement sequentiel et d'une instruction 
de dechargement sequentiel comportant des donnees destin6es a specifier une pluralite de registres dans le 
code objet de celie-ci ; 

55 le moyen (20) de decodage decode au moins I'une de I'instruction de chargement sequentiel et de Instruction 

de dechargement sequentiel ; et 

le moyen (10) d'execution effectue au moins Tun d'un traitement consistant a executer une pluralite de char- 
gements sequentiels de donnees, a partir de la pluralite de registres, dans une pile prevue dans la rnemoire, 
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et d'un traitement consistant a executer une plurality de dechargements sequentiels de donnees, a partir de 
la pile, dans la plurality de registres, sur la base du contenu d'une adresse en memoire sp6cifi6e par le registre 
(14) dedte de pointeur de pile et des donnees destinees a specifier une pluralite de registres, pendant I'exe- 
cution d'au moins Tune de ('instruction de chargement sequentiel et de 1'instruction de dechargement 
sequentiel ; 

caracterise en ce que : 

le moyen (10) d'execution comprend : 

un moyen d'ecriture destine a ecrire le contenu d'un registre donne qui est Tun des registres parmi la 
pluralite (11) de registres, dans une pile prevue dans la memoire (SOB), sur la base d'une adresse en 
memoire specifiee par le registre (14) dedie du pointeur de pile; 

un moyen (46) de comptage du nombre de fois d'ecriture, destine a compter le nombre d'ecriture du moyen 
d'ecriture ; et 

un moyen (45) de comparaison destine a comparer le nombre de fois d'ecriture comptee par le moyen 
de comptage, a la valeur des donnees destinees a specifier une plurality de registres ; 

dans lequel le moyen d'ecriture comprend : 

des moyens (30, 32) de production d'adresses d'ecriture en memoire, destines a addttionner une premiere 
entree et une deuxieme entr6e, au moyen d'un additionneur et a produire une adresse d'ecriture en m6moire, 
destinee a specifier une destination d'ecriture ; 

un moyen (45) de commande de premiere entree, destine a deiivrer une commande telle que la premiere 
entree de i'additionneur est le contenu du registre d6di6 du pointeur de pile, au demarrage de ('execution d'une 
instruction sequentielle specialisee, et est une adresse d'ecriture produite ulterieurement par un moyen de 
production d'adresses d'ecriture ; 

un moyen (45) de commande de deuxieme entree, destine a deiivrer un decalage utilise pendant I'ecriture 
d'un mot dans la pile, a la deuxieme entree de I'additionneur ; et 

un moyen (45) d'ecriture destine a 6crire dans la pile le contenu d'un registre sp6cifie, par un traitement de 
soustraction qui utilise les donnees destinees a specifier une pluralite de registres et le nombre de fois d'ecri- 
ture, sur la base de I'adresse d'ecriture en memoire ; 

de sorte que I'ecriture du contenu de la pluralite de registres, dans la pile et I'achevement de I'ecriture sont 
commandes sur la base du resultat de comparaison du moyen de. comparaison. 

Circuit de traitement de donnees comprenant : 

un registre dedie de pointeur de pile qui est utilise uniquement pour le pointeur (14) de pile ; 
un moyen (20) de decodage destine a decoder un code objet d'un groupe d' instructions dediees de pointeur 
de pile qui comportent un code objet specifiant le registre dedie de pointeur de pile, en tant qu'operande 
implicite et qui se rapportent a un traitement fonde sur le registre dedie de pointeur de pile, et destine a deiivrer 
un signal de commande, sur la base du code objet ; 

moyen (10) d'execution destine a executer le groupe destructions dediees de pointeur de pile, sur la base 
du signal de commande et du contenu du registre dedie de pointeur de pile; et 
une pluralite (11) de registres, prevue dans une suite contiguS, 

dans lequel le groupe ^instructions dediees de pointeur de pile comprend au moins rune d'une instruction 
de chargement sequentiel et une instruction de dechargement sequentiel comportant des donnees destinees £ 
specifier une pluralite de registres dans le code objet de celle-ci; 

le moyen (20) de decodage decode au moins I'une de I'instruction de chargement sequentiel et de ('instruction 
de dechargement sequentiel ; et 

le moyen ( 1 0) d'execution effectue au moins Tun d'un traitement consistant a executer une pluralite de char- 
gements sequentiels de donnees, a partir de la pluralite de registres, dans une pile prevue dans la memoire et 
d'un traitement destine a executer une pluralite de dechargements sequentiels de donnees, a partir de la pile, 
dans la pluralite de registres, sur la base du contenu d'une adresse en memoire specifiee par le registre (14) dedie 
du pointeur de pile et des donnees destinees a specifier une pluralite de registres, pendant ('execution d'au moins 
I'une de I'instruction de chargement sequentiel et de I'instruction de dechargement sequentiel; 

caracterise en ce que : 
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le moyen (10) d'execution comprend : 

un moyen de lecture destine a lire le contenu d'une pile, prevue dans la memoire, sur la base d'une adresse 
en memoire specified par le registre dedie du pointeur de pile, et destine a mGmoriser le contenu dans 
un registre donne de la pluralite de registres ; 

un moyen de comptage de nombre de lecture destine a compter le nombre de fois de lecture du moyen 
de lecture ; et 

un moyen (45) de comparison destine a comparer le nombre de lecture compte par le moyen de comp- 
tage, a la valeur des donnees destinees a specifier une pluralite de registres ; 

dans lequel le moyen de lecture comprend: 

un moyen (45) de commande de premiere entree, destine a delivrer une commande telle que la premiere 
entree d'un additionneur est le contenu du registre dedie du pointeur de pile, au demarrage de {'execution 
d'une instruction sequentielle specialised, et est une adresse de lecture en memoire, calculee ult6rieurement 
par un moyen de production d'adresse de lecture ; 

un moyen (45) de commande de deuxieme entree, destine a delivrer un decalage utilise pendant la lecture 
d'un mot a partir de la pile, a la deuxieme entree de ('additionneur ; 

un moyen (45) de lecture destine a lire le contenu de la pile, sur la base de I'adresse de lecture en memoire 
et a memoriser le contenu dans un registre specifie sur la base du nombre de fois de lecture; 
de sorte que la lecture du contenu de la pile et I'achevement de la lecture sont commandes sur la base du 
resultat de comparaison du moyen de companaison. 

Circuit de traitement de donnees, suivant la revendication 1 ou 2 comprenant, en outre : 

n registres (11) non specialises, specifiee par les numeros de registres 0 a n-1 ; 

dans lequel le code objet d'au moins I'une de Instruction de chargement sequentiel et de instruction de 
dechargement s6quentiel, comprend un numero de registre final auquel Tun des numeros de registre est spedifie, 
en tant que donnees destinees a specifier une pluralite de registres ; et 

le moyen (10) d'execution effectue au moins I'un d'un traitement consistant a executer une pluralite de char- 
gements sequentiels de donnees dans une pile prevue dans la memoire, a partir d'une pluralite de registres, du 
registre 0 a un registre specifie par le numero de registre final et d*un traitement consistant a executer une pluralite 
de dechargements sequentiels de donnees, de la pile dans une pluralite de registres, du registre 0 a un registre 
specifie par le numero de registre final, sur la base du contenu d'une adresse en memoire specified par le registre 
dedie de pointeur de pile. 

Circuit de traitement de donnees, tel que defini dans I'une quelconque des revendications 1 a 3 comprenant, en 
outre : 

un registre (12) compteur ordinal utilise seulement pour le compteur ordinal ; 

dans lequel le groupe destructions dediees de pointeur de pile comprend des instructions de branchement 
qui sont une instruction de branchement a un sous-programme et une instruction de retour du sous-programme ; 
le moyen (20) de decodage decode des instructions de branchement ; 
le moyen (10) d'execution comprend : 

un moyen destine a executer au moins un traitement consistant a sauvegarder le contenu du registre (12) 
compteur ordinal dans une deuxieme zone donnee de la pile, prevue dans la memoire ou un traitement con- 
sistant a restaurer le contenu de la deuxieme zone dans le registre (12) compteur ordinal, sur la base d'une 
adresse en memoire specifiee par le registre (12) dedie du pointeur de pile, pendant I'execution de ('instruction 
de branchement ; et 

un moyen destine a mettre a jour le contenu du registre (12) dedie de pointeur de pile, sur la base de la 
sauvegarde et de la restauration. 

Circuit de traitement de donnees, tel que defini suivant I'une quelconque des revendications 1 a 4, dans lequel le 
circuit utilise des instructions d'un ordinateur a jeu destructions reduit 
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6. Circuit de traitement de donnees, tel que defini suivant Tune quelconque des revendi cations 1 a 5, dans lequel : 

une instruction de longueur fixe est decodee et I'execution est fondee sur I'instruction de longueur fixe. 

5 7. Micro-ordinateur comprenant le circuit de traitement de donnees, tel que defini suivant Tune quelconque des re- 
vendications 1 a 6, un moyen de memorisation et des moyens d'entree et de sortie, destines a introduire des 
donnees a partir de dispositifs externes et a leur delivrer des donnees. 

8. Micro-ordinateur tel que defini suivant la revendication 7, 

10 dans lequel un programme qui est execute utilise ainsi un langage de programme qui reserve une zone de 

me moire destinee a des variables automatiques, en utilisant le pointeur de pile. 

9. Materiel electronique comprenant le micro-ordinateur defini a la revendication 7 ou 8. 
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FIG. 3 



Memory 
address 




tOO Stack region 



28 




29 



EP 0 809 180 B1 



FIG. 5 
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FIG. 6A 
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FIG. 8 
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FIG. IOA 
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FIG. 12 
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FIG . 13 
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FIG . 14 
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FIG. 15 
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FIG. 16 
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FIG. I7A 
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FIG. 19 
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FIG. 21 

Flowchart of pushn execution 
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FIG.22 

Flowchart of pop execution 
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